|
|
Accession Number |
TCMCG010C07442 |
gbkey |
CDS |
Protein Id |
XP_016560366.1 |
Location |
join(145030611..145030736,145031553..145031604,145031719..145031766,145032350..145032504,145032594..145032651,145032735..145032792,145032911..145033003,145033079..145033260,145033468..145033664,145039268..145039337,145039579..145041078,145041167..145041296,145041394..145041488,145041752..145041825,145041907..145042116,145042794..145042838,145042908..145042961,145043091..145043197,145043355..145043403,145043732..145043791,145043916..145043988,145044372..145044457,145044580..145044756) |
Gene |
LOC107859776 |
GeneID |
107859776 |
Organism |
Capsicum annuum |
|
|
Length |
1232aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA319678 |
db_source |
XM_016704880.1
|
Definition |
PREDICTED: DNA mismatch repair protein MLH3 isoform X2 [Capsicum annuum] |
CDS: ATGGGGAGCATTAAGCGAATGCCCGAGGCTATTTGGAGCAGGATGCGGTCAGGGGTTATCTTGTATGATTTCACTAGGATTGTTGAGGAGTTGGTTTTCAACAGCCTCGATGCTGGTGCCACCAAGGTATATGTTGCTATAGGAATTGGAACCTGCTATGTTAAGGTGGATGACAATGGATCTGGTGTTTCACGAGATGGACTGGTGCTGATGGGAGAAAGATATGCGACATCAAAATACAGCAATTCAGATGATATGCATGCTTTTCCCACAAGTTTTGGCTTTAAAGGAGAGGCTCTGAATTCTATTTCTGATGTTTCTTTGTTGGAAATTGTTTCTAAAATTCAGGGGAGGCCAAATGGATATCGTAAGGTTTTGAAGGACGGCAAGTGTTTGTACCTTGGAATTGATGATTCTAGACAAGATGTTGGTACAACAGTCATTGTTCGTGATGCGTTTTACAACCAACCAGTTCGAAGGAAGCAAATGCACTCCAACCCAAAGAGGGTTTTGCATTCTCTGAAAGAGTCTCTGCTAAGAATCGCCCTTGTGCATCCCAGTGTTTCCTTCAAAATTATTGATATTGAAAGTGAGGATGACTTGCTTTGGACACATGCTTCTCCTTCTCCGTTGCCGCTCCTGTCTAGTGGGTTTGGGATTCATCTGAGTTCCCTTAACAAATTGAATGCAAGTGCTGGTTCATTCAAGCTCTCAGGATACATTTCAGGTCCTGATGTTTACACTGTGAAGGTATATCGGCCCAAATTGATGGATATCAATTCAAGATTTGTTTCCAAAGGACCAATACATAAATTACTTAATAACCTAGCGATGAGTTTTGACAGTGCTTCTGACATTGAGCAGCGAGGTAGATCTCAGAGAAATCCACTGTTCTTTTTAAACCTAAACTGCCCAAGATCTTTGTATGATTTGACTTTGGAGCCTTCAAAGACCTCTGTGGAATTTAAGGATTGGCGCTCCGTCCTTCTCTTTGTTGAGGATACTGTCGTGAATCTCTGGACAGAAAGTAACGCTGCTGATATACCTGTGAATTATGAGACTGGAAAAAAGAGGAGCAGGGCCCAGAGTTGCAAAGCTACTCTTGAGCTTCCTTCCCCGCAGCCAAAGAAATTGACTGGAGACTGCACTGCCAGGAAAGAAATTCAATCTTCACAGAATATTCCGTGGGAAAGTTCTTCTGAAAAGCATGATCCTGAGTCCAGATTCCTCTGTCAAATGGAAAGTTCAAGTTGGTCAATTGATGGGTCTCTTGCTCATGGTAAAGTCGGTGTGAACTGGAAATCCAGAAGCTCTGTGCAACCCCTTTCATCTAATGTTCTACCTACAGTGGATGATTTCCTGGACAGCAAATTCAATGCTTCAGCTAGCTCTAGTTATAAATCAGACTGCCTGTTTGGTTCAGGATGGGAGGATGAGTCTCAAACAATTGTAGCTGGCAGATCAACAGGGGATGCCTCCTTTAGGAAGTCTTTTGAACTTGATGACAGTTCAAATGTGATGCATGAGAGAAGGGAACCATTTATGCGGAGCTGTTCTTTGCACAGGAGCCTGATACATGATGGAGCATCTTTTGATAGTGATGAAGATATTAAGTTTGAAAAAAGTGACTACAGAACTAAACAAAATTGCCTTGAAGATGATTGTAGTGTTGAATTTGAAGTAGTTGATGATGTTAACCAGGTCTTAAATCAAAGGTCTAACCAGGTCTTAGATCATAGGTCTCCTAGACGCAAAGAAATATATTTTGAGAACTTCTCCAGGTGCAAAACTCCGAGCAAGGCATTGCAGAGGTCAAGATTTTTGTCTGGAGATTCAGAAAAATCTTCCTTAACCAAGGACATTCTAGATGAAGACGATCATCTTATGGACTTCGTTAAACAGACTGAAAATTATGGTTCTGGCCTGTTGTCTTTTAGTCCAGACCCGTCTCCTCTGCCACCAGATCCTTTTCTCGGGACCAGGTTTCAAGTTGTTAATCCTTACATCGCTGAAAATGGGATTGAAACTTCTGCTAAACATGAATTTGATTTCATGTATAATTTTGGAAATATGGAACATAATATCTTGGTTCCTGCCATAAGTAATTGGGAAAAAGAGGACTGCTTCTTCCCAGATCCTGCAAAGTTTGATCTCAATTTTAATGCTTGTTCTAGAGAGGATATGGGCAGTATAGGGGGACTTGATTCGTGGGACGTTTGTAATTCAGGTCCTTCTGAATTCCATTATGATGGAGATGATTTGTCACGTATACATTCTCATGGTGAAGAAAATCTTAATAATTCTTTGATTCCACGAGCTATGCTTTCCTCTAGAGTGGATGTGGACTCGCATAAATGGATTGATGCTGGAAATCAAGGTAAAACAGATGAGCCTCTAAGGAAGAAGAAAAAGACAAGAAGTCATTCGGCTCCTCCATTTTACCAAGGCAAGAAGTTCTTTGCCACTGGTGAGTCTTCAAGAAAGGCAGCAGGAAATAACAATATTAAGACTGTTCATGATGTGCCACTCATCATGCCAGAAACAAGGGGTGTAAGGAGATTGCAACATTCTGCAGAAGCTATCTGCTCGGAGCTTCCCCAGCAGTCATCCAATCAATGTGATCTATCTTCGACCCCAAGTTATGGTGACAGTGTATTCTTTGATGAAAGGCCAAGTGTGAAAACGAAACTTGTTAATATCTGGAATAGCAATTTACAAACTCAAGGGGAGTGCACAAGCACACACGATAAGGAGTCAAATGAAGAATTTGCACCAACAAAAACTCAAAGTATCTTAGATTCTGGGACAAAATGGAGGGACTTCTGTCCAGAGGTTACATATGAGATGCAGAAGAGTACCGGAACAGAGAATCTTAAGAATCAGGATACTATACTCAATGTCACTTCTGGCATCTTGCATATGGTTGGTGATTCATTGATTCCTGATACCATTGATAAAAACTGCCTGGAGGGTGCCAAAGTTCTCCAACAGGTTGATAAGAAGTTTATTCCGATTGTGGCAGGCACAACACTTGCTATAATTGATCAGCATGCTGCAGATGAGCGAATTTGTTTGGAAGAACTGCGTGATAAGGTTCTATCTGGACAAAAGAGGACAACAACCTATCTTGATTCCGAGCAAGAATTGGTCATGCCTGAAATTGGTCACCAATTACTACTCAACTATGCTGACCAAATTCAAAACTGGGGTTGGATCTGCAATATTCATTCTCAAGCTTCAAGATCATTTTCCAGGAACTTGAATCTGATTCACAAGCAGCCAACATCTGTCACACTTCTTGCGGTTCCGTGTATTTTGGGCGTTAATCTAACTGATGTGGATCTATTAGAATTTCTTCAACAGCTTGCTGATACAGATGGATCATCAATCGTACCACCATCAGTGAATCGGGTCCTGAATAACAAGGCTTGCAGGAGTGCAATTATGTTTGGAGATGCATTGTTGCCTTCAGAATGTTCTCTCATTGTTGAGGAGTTGAAGCAGACTTCATTGTGTTTTCAATGTGCTCATGGACGACCGACTACTGTCCCTCTTGTCAACTTGCGTGCTCTGCATGACCAGATTGCTAAGTTAGGTTCGTTGAGTAGGGGTTCCTCCATAACATGGCATGGATTACTACATCGGCGTGAAATCAACCTAGAGCGTGCAGCAGAGCGACTAAAATCAGCCGCATCCTAG |
Protein: MGSIKRMPEAIWSRMRSGVILYDFTRIVEELVFNSLDAGATKVYVAIGIGTCYVKVDDNGSGVSRDGLVLMGERYATSKYSNSDDMHAFPTSFGFKGEALNSISDVSLLEIVSKIQGRPNGYRKVLKDGKCLYLGIDDSRQDVGTTVIVRDAFYNQPVRRKQMHSNPKRVLHSLKESLLRIALVHPSVSFKIIDIESEDDLLWTHASPSPLPLLSSGFGIHLSSLNKLNASAGSFKLSGYISGPDVYTVKVYRPKLMDINSRFVSKGPIHKLLNNLAMSFDSASDIEQRGRSQRNPLFFLNLNCPRSLYDLTLEPSKTSVEFKDWRSVLLFVEDTVVNLWTESNAADIPVNYETGKKRSRAQSCKATLELPSPQPKKLTGDCTARKEIQSSQNIPWESSSEKHDPESRFLCQMESSSWSIDGSLAHGKVGVNWKSRSSVQPLSSNVLPTVDDFLDSKFNASASSSYKSDCLFGSGWEDESQTIVAGRSTGDASFRKSFELDDSSNVMHERREPFMRSCSLHRSLIHDGASFDSDEDIKFEKSDYRTKQNCLEDDCSVEFEVVDDVNQVLNQRSNQVLDHRSPRRKEIYFENFSRCKTPSKALQRSRFLSGDSEKSSLTKDILDEDDHLMDFVKQTENYGSGLLSFSPDPSPLPPDPFLGTRFQVVNPYIAENGIETSAKHEFDFMYNFGNMEHNILVPAISNWEKEDCFFPDPAKFDLNFNACSREDMGSIGGLDSWDVCNSGPSEFHYDGDDLSRIHSHGEENLNNSLIPRAMLSSRVDVDSHKWIDAGNQGKTDEPLRKKKKTRSHSAPPFYQGKKFFATGESSRKAAGNNNIKTVHDVPLIMPETRGVRRLQHSAEAICSELPQQSSNQCDLSSTPSYGDSVFFDERPSVKTKLVNIWNSNLQTQGECTSTHDKESNEEFAPTKTQSILDSGTKWRDFCPEVTYEMQKSTGTENLKNQDTILNVTSGILHMVGDSLIPDTIDKNCLEGAKVLQQVDKKFIPIVAGTTLAIIDQHAADERICLEELRDKVLSGQKRTTTYLDSEQELVMPEIGHQLLLNYADQIQNWGWICNIHSQASRSFSRNLNLIHKQPTSVTLLAVPCILGVNLTDVDLLEFLQQLADTDGSSIVPPSVNRVLNNKACRSAIMFGDALLPSECSLIVEELKQTSLCFQCAHGRPTTVPLVNLRALHDQIAKLGSLSRGSSITWHGLLHRREINLERAAERLKSAAS |